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Atoms and molecules are important conceptual entities we invented to understand the physical world around 
us. The key to their usefulness lies in the organization of nuclear and electronic degrees of freedom into a 
single dynamical variable whose time evolution we can better imagine. The use of such effective variables 
in place of the true microscopic variables is possible because of the separation between nuclear time scales 
(very fast), electronic time scales (fast), atomic time scales (slow), and molecular time scales (slower still). 
Where separation of time scales occurs, identification of analogous objects in complex systems — an example 
of which is the financial market — can help advance our understanding of their dynamics. To detect separated 
time scales and identify their associated effective degrees of freedom in financial markets, we devised a two- 
stage statistical clustering scheme to analyze the price movements of stocks in several equity markets. The price 
movements are first clustered at a short time scale, and thereafter, re-clustered based on separated correlation 
levels that become evident when these short- time correlations are examined over a longer time scale. Through 
this two-time-scale clustering analysis, we discovered a hierarchy of levels of self-organization in real financial 
markets, whereby lower level, rapidly-evolving dynamical structures are nested within higher level, slowly- 
evolving dynamical structures, which are themselves nested within even higher level, even more slowly-evolving 
dynamical structures. We call these self-organized dynamical structures (which are statistically robust) financial 
atoms, financial molecules, znd financial supermolecules. While many financial atoms have compositions that 
on hindsight appear trivial or easy to guess, there are also those whose composition are genuine surprises. With 
larger and better-regulated markets, the component stocks of financial molecules and financial supermolecules 
all fall within a single market sector, or several very closely-related market sectors. In markets with extensive 
cross ownership, or dominated by huge multi-industry holding companies, financial molecules and financial 
supermolecules straddling many distant market sectors were found. In general, the detailed compositions of 
these dynamical structures cannot be deduced based on raw financial intuition alone, and must be explained in 
terms of the underlying portfolios, and investment strategies of market players. More interestingly, we find that 
major market events such as the Chinese Correction and the Subprime Crisis leave many tell-tale signs within 
the correlational structures of financial molecules. 

PACS numbers: 05.45.Tp, 87.23.Ge, 89.65.Gh, 89.75.Fb 



I. INTRODUCTION 

Predicting the movement of prices in financial markets is an 
important problem, more so in the light of the global financial 
crisis we find ourselves in right now. If we have the means to 
detect market undercurrents that do not have clear signatures 
on the surface, we might be able to steer clear of trouble, and 
perhaps even defuse financial landmines before things get out 
of hand. However, there are at present no reliable schemes for 
short- or long-term predictions, despite the fact that past infor- 
mation on all financial instruments are wholly available 
This stands in stark contrast to our ability to predict solar and 
lunar eclipses to very high precision, starting from Newton's 
Laws of Motion, and much less voluminous initial data. We 
believe this is due to the fact that financial markets are com- 
plex systems, whose dynamics follow self-organizing princi- 
ples we do not yet understand. Clearly, an important first step 
towards writing down any predictive models of financial mar- 
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ket dynamics would be to identify the important degrees of 
freedom. A priori, we have no clue whether these are indi- 
vidual financial instruments, or specific collections of instru- 
ments. We hence look to the physical sciences for inspiration. 

A. Statistical Learning in the Pliysical Sciences 

In the physical sciences, we talk about atoms and molecules 
as if they are real entities. Quantum-mechanically, all atomic 
and molecular systems must be described by a single many- 
body wave function *F(ri, . . . , r^, t) whose dynamics is gov- 
erned by the Schrodinger equation ihidV jdt) - }r¥. If we 
admit that the wave function ^{v\, . . ., r^, f) furnishes a suffi- 
cient description of the problem, then atoms have no real exis- 
tence inside a molecule, and molecules have no real existence 
within an interacting collection of molecules. Nevertheless, it 
is very useful to continue speaking of them, because they sim- 
plify our mental models of the processes that take place within 
the collection of nucleons and electrons. This is admissi- 
ble, and in fact physically meaningful, because the dynamical 
structures that atoms and molecules participate in evolve over 
time scales significantly longer than the intrinsic time scales 
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set by nuclear and electronic motion, i.e. there is a separation 
between nuclear/electronic time scales and atomic/molecular 
time scales. 

To better illustrate how our association of certain collec- 
tions of nuclear and electronic degrees of freedom with atoms, 
and other collections of nuclear and electronic degrees of free- 
dom with molecules arise naturally from this separation of 
time scales, let us imagine a universe containing a single wa- 
ter molecule 'H2^^0, consisting of ten protons, eight neutrons, 
and ten electrons. Under ordinary conditions, the nucle- 
ons and the two 'H protons are always very well separated 
spatially, so in principle we have no problem telling them 
apart. For the ten indistinguishable electrons, however, we 
cannot simply assign eight labeled electrons to "'O, and one 
labeled electron each to the two 'H. Nevertheless, if we make 
a movie of the time evolution of electronic densities within the 
water molecule, we will find an electronic distribution close 
to '^O nucleus that evolves pretty much independently of the 
electronic distributions between the '^O nucleus and 'H nu- 
clei. We think of the former as the density of ^^O core elec- 
trons, and the latter as the density of the bonding electrons. 
Unlike the '^O core electrons, the bonding electrons cannot 
be associated with either the "'O atom or the 'H atom alone. 
Instead, they must be associated with the two chemical bonds 
— another conceptual tool we invent to help us understand the 
nature of stable bound states between atomic species — in the 
water molecule. 

Now let us imagine actually producing such a movie, 
through direct integration of the time-dependent Schrodinger 
equation for the water molecule. If we assume that the nucle- 
ons are static, we can solve this problem in the basis of all pos- 
sible 10-electron configurations. Some of these configurations 
will contribute predominantly to the core density, while others 
will contribute predominantly to the bond density. Because of 
the nucleon-electron and electron-electron Coulomb interac- 
tions, the quantum-mechanical amplitudes of these configura- 
tions will evolve with time, sometimes rapidly, and sometimes 
slowly. As a result of these variable rates of time evolution, if 
we examine the configurations contributing predominantly to 
the core electronic density, over a time interval that is on 
the order of the electronic time scale, we might find their am- 
plitudes to be sometimes correlated, and sometimes uncorre- 
cted. In fact, over electronic time scales, some core electronic 
amplitudes might even be more correlated with bonding elec- 
tronic amplitudes than with other core electronic amplitudes. 
Nevertheless, based on our intuitive picture of distinguishable 
core and bonding electronic densities, and the separation of 
electronic and atomic/molecular time scales, we expect the 
long-run averages of core-core short-time correlations, as well 
as the long-run averages of bond-bond short-time correlations, 
to tend towards some nonzero values, while the long-run av- 
erages of core-bonding correlations tend towards zero. 



B. Atoms and Molecules as Robust Dynamical Structures 

The understanding that emerges from this dynamical pic- 
ture is that core and bond amplitudes should not be treated as 



objects with immutable labels. Instead, they ought to be iden- 
tified as clusters emerging from a statistical learning proce- 
dure, where we repeatedly calculate correlations at the elec- 
tronic time scale (the short time scale), and examine how 
the patterns of short-time correlations look like over a much 
longer duration (the long time scale). Given a collection of 
unlabeled time-dependent amplitudes, we can always make 
use of this procedure of statistical clustering at two differ- 
ent (separated) time scales to discover the compound objects, 
things that we would go on to call atoms and molecules. More 
generally, if we are given a large number of time-dependent 
variables, determining how short-time correlations are them- 
selves correlated over longer times would allow us to deter- 
mine which sets of variables are strongly correlated over long 
time scales. These strongly-correlated set of variables then 
serve as good candidates for efl'ective degrees of freedom at 
the long time scale. 

Ultimately, for dynamical features as robust as the core 
electronic densities and the bonding electronic densities, we 
expect their statistical signatures to be generally insensitive to 
specific choices of the short and long time scales, so long as 
the short time scale is comparable to electronic time scales, 
and the long time scale is comparable to the atomic/molecular 
time scale. In addition, it should not matter what short-time 
correlations we evaluate, and what long-time correlations of 
these we further determine. For some choices of short-time 
correlations, statistical signatures of the core and bond clus- 
ters of amplitudes might emerge fairly quickly during the 
long-time clustering. For other choices of short-time corre- 
lations, we might have to cluster over a longer time to detect 
these statistical signatures. Recast as a learning problem, the 
extraction of robust, and therefore physically meaningful, dy- 
namical features boils down to a simple problem of statistics 
and statistical significance. The simplest scheme that imple- 
ments the two-time-scale statistical clustering procedure out- 
lined above is based on the sign of the rate of change, as shown 
in Figure [T] In this scheme, two scalar variables x, and Xj are 
correlated over a short-time window if they are both increas- 
ing or decreasing. Otherwise, the two variables are considered 
to be uncorrected. Over the long time scale, we define their 
long-time correlation C(i, j) to be the number of times they 
are correlated with each other at short time scales. These pair- 
wise correlations can be organized into a long-time correlation 
matrix C, which is the starting point for our identification of 
strongly-correlated compound objects. 



II. DATA AND RESULTS 

Armed with this understanding, we analyzed five financial 
markets, the New York Stock Exchange (NYSE), the Lon- 
don Stock Exchange (LSE), the Tokyo Stock Exchange (TSE), 
the Hong Kong Stock Exchange (HKSE), and the Singapore 
Stock Exchange (SGX) over a two-year period from January 
2006 to December 2007 (see TableU). These markets are cho- 
sen specifically for contrast: the NYSE, LSE, and TSE are 
mature markets in G7 countries, whereas the HKSE and SGX 
are emerging markets. Presumably, the dynamics in emerg- 



3 



magnitudes 
short-time windows 



long-time window 




-►time 



short-time clusters 



l-l-l-kl-kl 



+++++-+-+- 



I kkkkkhkhkkhkF 



kkkkkkl 



CU. i) 



X X 

X X 

X 

X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 
long 



X X X X 

X X X X 

X X X X 

X X X X 

X X X 

X X X 

X X X 

X X X 

X X X X 



XXX 
X X X X 
X X X X 
X X X X 

-time correlation matrix 



X X 
X X 
X X 
X 
X 
X 
X 
X 

X 
X 
X 
X 



X 
X 
X 

xlx 



X 

X X 

X X 

X X 

X X 

X 

X 

X X 

X X 



X X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 

X 

X 



C(i.j) 



FIG. 1: Scheme of the two-time-scale statistical clustering method. 
In the top panel, the time series of A' variables are examined within 
a long-time window, which consists of a large number of short-time 
windows. In short-time window f, variable ; is assigned to the 
cluster if Ax,(/) = x,(f + 1) - Xi{t) > 0, and to the '-' cluster if 
Ax,(f) = Xj(t + 1) - x,(f) < 0, as shown in the middle panel. In the 
bottom panel, we compute long-time correlation matrix C, such that 
C(i, j) is the number of times variables i and j are assigned to the 
same short-time cluster. 



ing markets should be different from that in mature markets, 
and thus one might naively expect structurally different self- 
organizations in these two classes of markets. To keep the 
datasets manageable, we restricted our analysis to only stocks 
and stock related securities. Since stocks are important com- 
ponents in any financial market, we believe their dynamics 
alone would offer us valuable insights into the inner workings 
of the financial world. 



TABLE I: The total number of stocks and market sectors monitored 
from January 2006 to December 2007, for each of the five equity 
markets studied. These five markets are chosen to contrast mature 
markets in G7 countries (one each in North America, Europe, and 
Asia), and emerging markets in Asia. 



NYSE 


LSE 


TSE 


HKSE SOX 


Total number of stocks 6743 ' 
Total number of market sectors 10^ 


8820^ 
10^ 


2636^ 
39^ 


1070^ 546' 



'Thomson Datastream, http://www.datastream.com/ 

-Taqtic, SIRCA, https : //taq t ic . sirca . org . au/TaqTic/l 

^ http : //finance . yahoo . com/| [2]. 

'^Hong Kong Stock Exchange, 

jhttp : / /www . hkex . com . hk/inv est/index . asp? | 

[id=company/prof ilemenu_pag e-e . asp| 

'Reuters. 

'SGX - Singapore Exchange Ltd, 

[htt p : // info . sgx . com/webs tocks.nsf/revamp+new+all+| 
|stocks+sector?OpenView] 



A. Financial Atoms and Atomic Correlation Levels 

For each financial market, we looked at the daily price 
movements {Api(t), Ap2(t), . . . , Api(t), . .., Apff(t)} of the 
stocks making up the market. Here, Api(t) - pi{t -i- 1) - Pi(t), 
and pi{t) is the price of stock / on day f. For this choice of 
data frequency, we have effectively selected a short time scale 
of one day. Our long time scale, chosen to ensure statistical 
significance [3], is the two-year observation period, consist- 
ing of 500-1- short time windows. It is useful to keep in mind 
that for all financial markets, there is a microscopic time scale 
set by the average trading interval, which can be anywhere 
between 10"^ s to 10' s. Following the two-time-scale clus- 
tering procedure outlined above, we assign stocks ; and j to 
the same short-time cluster at time t, if Api(t) and Apj(t) are 
of the same sign. A stock k whose price does not change from 
day t to day t + I, i.e. Ap^it) - 0, is left unassigned. In 
the long-time correlation matrix C that emerges from this pro- 
cedure, we find a background correlation level cq reflecting 
market-level drifts, experienced by all stock prices, that re- 
sults from rallies and crashes. This background correlation 
level Co varies from market to market. In general, the larger cq 
is, the more liquid the market. 

To get a rough sense of the dynamical structures having 
long-time correlations over and above market-level drifts, we 
zero matrix elements in C below a threshold c > cq. As c is 
raised, we go from a dense matrix, telling us that each stock is 
correlated with a large number of stocks (though not equally 
strongly), to a sparse matrix, telling us that each stock is cor- 
related only with very few other stocks (but very strongly 
so). Just as surfaces of high constant electronic density fre- 
quently reveal the atomic constituents of a molecule, very 
large matrix elements in C reveal the atomic constituents of 
the given financial market. We call these small clusters of very 
strongly correlated stocks financial atoms. The correlation 
levels within financial atoms rise significantly above the back- 
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ground level cq, because the dynamics of atomic stocks are 
coherent over time scales much longer than the average trad- 
ing interval, which plays the role of the 'nuclear/electronic' 
time scale. The existence of such financial atoms is evident 
from earher studies I4,5,4i&9,iallil[il[il[ll[il[il, 
HIlMllMllSllll], where they are referred to as synchronized 
clusters. However, the separation of 'nuclear/electronic' and 
atomic time scales was not recognized. The potential of using 
financial atoms as effective variables to describe real financial 
markets is also not widely appreciated. 

To properly identify these financial atoms in a given mar- 
ket, we need to be careful: just as core electronic densities are 
different for different atoms, we suspect the correlation lev- 
els may also be different for different financial atoms. This 
suggests the need for multiple thresholds, instead of a sin- 
gle threshold c, if we use agglomerative hierarchical cluster- 
ing to discover the financial atoms. In order for the associ- 
ation between clusters in the hierarchical clustering tree and 
atomic/molecular components in the financial market to be 
statistically robust, we require the clusters to be insensitive 
to variations in the individual thresholds. To accomplish this, 
one can imagine a sophisticated algorithm starting out with a 
large number of thresholds, which are progressively merged as 
they are slowly raised, based on how sensitive the clusters are 
to these thresholds. Such an algorithm, however, is difficult to 
implement, and thus we adopt a simpler approach, where we 
start from a seed cluster, i.e. a core set of two very-strongly- 
correlated stocks, and grow the hierarchical cluster outwards. 
We call this procedure partial hierarchical clustering, and the 
sequence of correlation levels at which new members are ad- 
mitted into the growing cluster its partial hierarchical clus- 
tering history. Figure |2] shows the typical partial hierarchi- 
cal clustering histories obtained using different linkage algo- 
rithms. With all linkage algorithms, the statistical signature to 
look out for is a sharp change in slope of the correlation level 
as a function of cluster size. This can take the form of sharp 
spikes, sharp drops, or simply kinks. These statistical signa- 
tures have different interpretations for the different linkage al- 
gorithms. We pick the complete link algorithm for our analy- 
sis proper, because the sharp drops and kinks in the correlation 
level can be most naturally interpreted as cluster boundaries. 
Compared to the statistical signatures used by most existing 
statistical learning algorithms 11221 1231 1241 1251 l26i l27l |28L l29l 
HmmillllllllllMIl^ lll^ErSS^ETSi , we beneve 
ours is more intuitive. 

Since the total number of financial atoms in a given 
market is not known, we discover them iteratively as fol- 
lows. First, let us use C*^*'* = C to denote our long-time 
correlation matrix, and find the maximum matrix element, 
max,-,yC(»)(/,j) = C(0)(/(i>,/i>). Using {/<'),/'>} = {4'\4"} 
as our first seed cluster for complete-link partial hierarchi- 
cal clustering, we look for the sharpest change in the slope 
of the correlation level as a function of cluster size, which 
marks the 'boundary' of the first candidate financial atom 
(see Figure [3]l. To find the second finan- 
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cial atom, we zero the ^Jj^} rows and columns 

of C*"', so that the partial hierarchical clustering can proceed 
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50 100 
cluster size 

FIG. 2: Typical plots, taken from the SGX, of the correlation level 
as a function of cluster size when we perform partial hierarchical 
clustering using (a) the single link algorithm; (b) the average link 
algorithm; and (c) the complete link algorithm. Spikes in the single- 
link partial hierarchical clustering history arise as the growing cluster 
incorporates a small group of very strongly correlated stocks (finan- 
cial atoms), while sharp drops and kinks in all partial hierarchical 
clustering histories arise when the growing cluster has more or less 
exhausted stocks that are correlated with each other at a certain cor- 
relation level (financial molecules), and starts incorporating stocks 
that correlated with each other at a lower correlation level. 



independently of constituents of the first candidate financial 
atom. We call this modified correlation matrix C*'\ Again, 



we find maxyC*i>(/,;) = C^^\i'-^KP), and use {i 
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as our second seed cluster for complete-link par- 
tial hierarchical clustering. From the sharpest change in the 
slope of the correlation level, we identify the second candi- 



date financial atom {/, , / 
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, / ,1, ), and proceed to zero the 
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;i(2)i rows and columns of C*'' to get C^^'. This 



iterative procedure mimics a full hierarchical clustering hav- 
ing different thresholds, which we can use to find as many 
candidate financial atoms as we like. 

For a candidate to qualify as a true financial atom, the 
constituent atomic stocks must be more strongly correlated 
with each other than they are with stocks outside of the fi- 
nancial atom. This 'self-consistency' criterion defines an up- 
per atomic correlation level, and if we are to adhere to this 
'self-consistency' criterion strictly, only clusters of stocks 




FIG. 3: A financial atom is identified from the complete-link partial 
hierarchical clustering history of a given seed cluster by searching 
for the sharpest change in slope at small cluster sizes. Here, we show 
the Singtel financial atom in the SGX, which consists of three stocks 
(two of which are shown as decorated solid circles). 



with maximum long-time correlation rising above the upper 
atomic correlation level can be considered to have dynami- 
cally self-organized into strong financial atoms. This crite- 
rion is somewhat restrictive, so we looked for an alternative 
criterion compatible with the self-organization observed. In- 
specting the various partial hierarchical clustering histories of 
the most strongly-correlated seed clusters, we find the corre- 
lation levels at which the first non-atomic stock was admit- 
ted into the growing clusters to be tightly clustered around 
a small number of lower atomic correlation levels. We can 
therefore define weak financial atoms to be those candidate fi- 
nancial atoms whose maximum correlations are greater than 
the average of the lower atomic correlation levels. In Table HIl 
we show the upper atom correlation level, the average lower 
atomic correlation level, and the number of financial atoms 
based on the two criterions, in each of the five financial mar- 
kets. The compositions of strong financial atoms found in the 
five equity markets are shown in Appendix A. 

In all markets, we find small financial atoms comprising 
on the order of 10 stocks. In all markets except for the TSE, 
we find trivial financial atoms consisting of different stocks 
issued by the same company, e.g. VIA. A and VIA.B (both of- 
fered by Viacom) in the NYSE, SDR and SDRt (both offered 
by Schroders pic) in the LSE, 0019 and 0087 (both offered 
by Swire Pacific) in the HKSE, and Singtel, Singtel 10, and 
Singtel 100 (all offered by Singtel) in the SGX. In the NYSE, 
LSE and TSE, the constituent stocks of nearly all strong finan- 
cial atoms fall cleanly into the respective market sectors. In 
the LSE, the exception is financial atom 12, which consists of 
mostly insurance stocks (Aviva, Prudential, Legal and General 
Group, Friends Provident), but also contain one bank stock 
(Standard Chartered), and one investment trust stock (Foreign 



TABLE II: The number of strong and weak financial atoms discov- 
ered in the five equity markets. Also shown are the upper atomic 
correlation levels, the average lower atomic correlation levels, and 
the average molecular correlation levels for the five equity markets, 
out of a maximum possible value of 522 over the two-year observa- 
tion period. The maximum intra-cluster correlation of a cluster must 
exceed the upper (lower) atomic correlation level for it to qualify as 
a strong (weak) financial atom respectively. 

NYSE LSE TSE HKSE SGX 

Upper atomic correlation level 420 370 380 340 
Number of strong financial atoms 30 12 14 7 
Lower atomic correlation level 379 355 348 306 
Number of financial atoms 108 22 52 16 

Molecular correlation level 349 310 320 269 
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6 
262 

11 
211 



& Colonial Investment Trust). In the TSE, the exception is 
financial atom 9, which consists of real estate stocks (Tokyo 
Tatemono Co, Urban, Creed Co, Pacific Holding Inc), apart 
from Kenedix Inc, which is classified as belonging to the Ser- 
vices market sector However, Kenedix Inc's four business 
segments are all related to the real estate market. 

In the HKSE and SGX, on the other hand, we find a signif- 
icant proportion of strong financial atoms having constituents 
that straddle several market sectors (financial atoms 1 and 2 
in the HKSE, and financial atoms 4 and 6 in the SGX). The 
emergence of these strong cross-sector correlations can only 
be explained by the underlying shareholder profiles, and con- 
fluential business interests. For example, in the HKSE finan- 
cial atom 1, 40.23% of the stake in Cheung Kong is held 
by Hong Kong business tycoon Li Ka-Shing |43]. Cheung 
Kong, in turn, owns 49.97% of Hutchison Whampoa's shares. 
Similarly, the Hong Kong and Shanghai Banking Corpora- 
tion (HSBC) owns 62.14% of the shares of the Hang Seng 
Bank, and holds a 42.01% net stake in Sun Hung Kai Prop- 
erties. These two pairs of stocks have little in common in 
terms of ownership, and the main reason they are so strongly- 
correlated is probably Cheung Kong and Hutchison Whampoa 
buying over HSBC and Hang Seng's stakes in the e-commerce 
setup iBusiness on July 27, 2007 |44]. The HKSE financial 
atom 1 also contains the MTR Corp, a statutory corporation 
owned by the Hong Kong Government, and China Mobile, a 
state-owned enterprise of the People's Republic of China. The 
exact reasons for these two to be strongly correlated with Che- 
ung Kong-Hutchison Whampoa and Hang Seng-Sun Hung 
Kai Properties are unclear (HSBC has a 0.15% stake in MTR 
Corp, but this is surely too little to account for the observed 
correlations). We believe these strong correlations may be 
related to the perceived competition between China Mobile 
and Hutchison Whampoa, which has a telecommunications 
arm, the announcement on April 13, 2006 that the MTR Corp 
signed an agreement with the Beijing municipal government 
to build and operate a new Beijing rail line |45], or the an- 
nouncement on Nov 28, 2007 that Cheung Kong won a HK$7 
billion MTR property project ll46ll . 

Another interesting observation that we would like to high- 
light, is the finding of more than one strong financial atom 
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for some market sectors in the LSE and TSE. In contrast, we 
find at most one strong financial atom per market sector in 
the NYSE. In the LSE, we find a total of four strong finan- 
cial atoms belonging to the Investment Trusts market sector, 
whereas in the TSE, we find two strong financial atoms each in 
the Real Estate, Banks, and Non-Ferrous Metals market sec- 
tors. Looking at the example of the LSE financial atoms 4 
(Scottish Investment Trust and Witan Investment Trust) and 5 
(Monks Investment Trust and Scottish Mortgage Investment 
Trust), we find that Scottish Investment Trust and Witan In- 
vestment Trust have in common ownership by the French 
holding company AXA, which holds 11.30% and 16.00% 
stakes in the two investment trusts respectively |43]. The 
shareholders they have in common also include the Legal and 
General Gp (4.10% and 4.59% respectively) and Lloyds TSB 
Gp (2.99% and 0.26% respectively). In contrast, AXA is 
missing from the lists of larger shareholders for Monks Invest- 
ment Trust and Scottish Mortgage Investment Trust, which 
feature instead Barclays (5.99% and 3.61% respectively), as 
well as Legal and General Gp (3.80% and 3.60% respec- 
tively). For the example of the TSE financial atoms 5 and 
8, while the ownership makeups tell interesting stories, we 
believe the main reason for their dynamics to be decoupled 
is TSE financial atom 5 being made up entirely of large re- 
gional banks, with operating revenue less than US$1 billion, 
while TSE financial atom 8 consists of even larger diversified 
banks, with operating revenue significantly larger than US$1 
billion. 

Finally, we note that strong foreign financial atoms are de- 
tected in the NYSE and LSE, reflecting their more globalized 
character, but not in the other three markets. In the LSE, the 
only strong foreign financial atom (atom 11) consists of two 
Korean stocks issued by Samsung Electronics. In the NYSE, 
on the other hand, there are a total of 10 strong foreign finan- 
cial atoms (the American Depository Receipts (ADR) atoms). 
In general, foreign financial atoms are smaller compared to 
the typical local financial atoms. Many of them are also part 
of strong financial atoms in their home markets. We shall also 
see later, while LSE atom 1 1 is weakly correlated with the rest 
of LSE, that the dynamics of some of the NYSE ADR atoms 
are strongly coupled to that of many US atoms in the NYSE. 



out for sharp changes in the slope of the correlation level as 
a function of cluster size (see Figure |4]l. These occur when- 
ever a financial atom from a different financial molecule is 
added to the growing cluster by the partial hierarchical clus- 
tering procedure. Going over the partial hierarchical cluster- 
ing histories of all strong financial atoms, we find the correla- 
tion levels at which the first non-molecular atom was admit- 
ted into the growing clusters to be tightly clustered around a 
small number of molecular correlation levels. To illustrate the 
separation of atomic and molecular correlation levels that re- 
sult from the separation atomic and molecular time scales, we 
show the average molecular correlation level in Table [III for 
easy comparison against the atomic correlation levels. 




B. Financial Molecules and Molecular Correlation Level 

From Figure |2l we see statistical signatures indicating the 
presence of higher level self-organization in real financial 
markets. We refer to self-organized structures one level higher 
than financial atoms as financial molecules. Just as molecules 
are made up of atoms, we expect financial molecules to also 
be composed of financial atoms. Again, we intuitively ex- 
pect financial atoms in the same financial molecule to be more 
strongly correlated with each other, compared to financial 
atoms in different financial molecules. To find these entities, 
we perform complete-link partial hierarchical clustering of the 
fiill correlation matrix — so that all financial atoms will be 
considered — starting from a given financial atom. We then 
examine the partial hierarchical clustering history, and look 



FIG. 4: Complete-link partial hierarchical clustering histories start- 
ing from (top to bottom): financial atom 21 of the NYSE, financial 
atom 5 of the LSE, financial atom 7 of the TSE, financial atom 4 of 
the HKSE, and financial atom 5 of the SGX. In these plots, the most 
distinctive changes in slope of the correlation level, which mark nat- 
ural boundaries of financial molecules, are indicated by the black 
triangles. 

Using sharp changes in the slope of the correlation level- 
cluster size plots as natural markers for molecular boundaries, 
we can determine the collections of stocks making up vari- 
ous financial molecules in a given market. A molecular col- 
lection of stocks so identified represents part or whole of a 
financial molecule. In general, we need to examine several 
such molecular fragments to deduce the complete microscopic 
makeup of the financial molecule. However, we find this 
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stock-level composition of the financial molecule less useful 
than its atom-level composition. If one or more stocks from a 
financial atom is found in any molecular fragment of a finan- 
cial molecule, we say that the financial atom is a constituent 
atom of the financial molecule. We call the list of constituent 
atoms, and their pattern of correlations, the molecular struc- 
ture of a given financial molecule. Within this atomic cari- 
cature of the financial molecule, the molecular fragments are 
groups of constituent atoms that are more strongly correlated 
with each other than they are with other constituent atoms. As 
we shall illustrate below using the SGX financial molecule as 
example, the molecular fragments that we obtained starting 
from diff'erent constituent atoms, plus and minus some non- 
atomic stocks, are consistent with the overall correlational 
structure of the financial molecule. This tells us that financial 
molecules are robust self-organized features in the dynamics 
of real financial markets. 

Within a financial molecule, all constituent atoms are cor- 
related with each other, but more strongly with some, and less 
strongly with others. To produce an intuitive picture of the 
correlational structure within the financial molecule, we draw 
bonds between financial atoms that are strongly correlated. 
There is no unique way to do this, but we find the following 
rules producing rather informative diagrams: 

1 . area of a financial atom is proportional to the number of 
stocks it contains; 

2. solid circle for strong financial atoms, and dashed circle 
for weak financial atoms; 

3. select a upper correlation level cy and lower correlation 
level C2 based on the intra-molecular correlations ll47ll . 
and draw: 

(a) a thick bond between two constituent atoms if 
their maximum and minimum interatomic corre- 
lation exceeds c\ and C2 respectively; 

(b) a thin bond between two constituent atoms if their 
maximum interatomic correlation exceeds c\, but 
their minimum interatomic correlation is below 

(c) a dashed bond between two constituent atoms if 
their maximum interatomic correlation is between 
c\ and C2; 

(d) no bond between two constituent atoms if their 
maximum interatomic correlation is below ca; 

4. if a constituent atom is bonded to other constituent 
atoms by solid as well as dashed bonds, remove the 
dashed bonds; 

5. shade a constituent atom light gray, if its correlations 
with non-atomic components exceed C2. 

I. Financial Molecules in the SGX and HKSE 

In the SGX, we find only one financial molecule, whose 
composition depends on the level of statistical significance 
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Code Name 


Market Sector 


Atom 7 


BR9 


China Hongxing Sports 


Manufacturing 


P39 


Pine Agritech 


Manufacturing 


Atom 8 


F83 


Cosco Corp (Singapore) 


TSC 


N03 


Neptune Orient Lines 


TSC 


A04 


ASL Marine Holdings 


Manufacturing 


S68 


Singapore Exchange 


Finance 


5GJ 


AusGroup 


Manufacturing 


SEE 


Aqua-Terra Supply Co 


Services 


F12 


Fibrechem Technologies 


Manufacturing 


R07 


Rotary Engineering 


Construction 


U14 


UOLGp 


Property 


T03 


Tat Hong Holdings 


Commerce 


Non-atomic Stocks 


A61 


AsiaPharm Group 


Manufacturing 


B20 


Biosensors International Gp 


Manufacturing 


D79 


C&G Industrial Holdings 


Manufacturing 


C47 


China Aviation Oil (Singapore) Commerce 




Corp 




G86 


China Milk Products Gp 


Commerce 


E94 


Lizhong Wheel Gp 


Manufacturing 


SEN 


Midas Holdings 


Manufacturing 



FIG. 5: The six-atom SGX financial molecule comprising the strong 
financial atoms 3, 4, 5 and 6, as well as the weak financial atoms 7 
and 8 (see table above for compositions). This is deduced from the 
first natural boundaries in the partial hierarchical clustering histories 
of the strong financial atoms 3, 4, 5, and 6. The bonds are drawn with 
£■[ = 280 and C2 = 262. In the table above, TSC is an acronym for 
the transportation, storage, and communications market sector. 



we desire. When we look at the partial hierarchical cluster- 
ing histories starting from the strong financial atoms 3, 4, or 
5, we find two natural molecular boundaries (see bottommost 
panel in Figure|4]l. Depending on the starting financial atom, 
the statistically less significant first boundary suggests differ- 
ent lists of constituent atoms: (3, 4, 5, 6, 7, 8) when we start 
from strong financial atoms 3 and 5, and (3, 4, 7, 8) when 
we start from strong financial atom 4. From the partial hi- 
erarchical clustering history of strong financial atom 6, we 
identify only one natural molecular boundary, and list of con- 
stituent atoms suggested by this sole molecular boundary is 
(5, 6). Based on these observations, we conclude that the par- 
tial hierarchical clustering procedure finds different parts of 
the financial molecule if we start from different constituent 
atoms. These different parts, however, are consistent with the 
molecular structure of the six-atom financial molecule shown 
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in Figure |5] which we deduce by taking the union of the dif- 
ferent lists of constituent atoms, and drawing bonds based on 
the rules listed above. Different starting constituent atoms also 
give us different lists of constituent non-atomic stocks. Again, 
we take the union of these lists, and find that the constituent 
non-atomic stocks are most strongly correlated with the finan- 
cial atoms 3, 4, and 8. Because their strong sub-atomic cor- 
relations with multiple financial atoms, we can interpret these 
constituent non-atomic stocks as 'bonding' stocks. 
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Code 


Name 


Market Sector 


Atom 9 


L28 


Longcheer Holdings 


Services 


L46 


Luzhou Bio-chem Technology 


Manufacturing 


Atom 10 


F20 


Federal International (2000) 


Commerce 


T06 


Tiong Woon Corp Holdings 


TSC 


Atom 11 


A17U 


Ascendas Real Estate 


Property 




Investment Trust 




C61U 


CapitaCommercial Trust 


Property 


C38U 


CapitaMall Trust Management 


Property 


P27 


Parkway Holdings 


Services 


Additional Non-atomic Stocks 


CEGL China Essence Gp 


Manufacturing 


E13 


Ellipsiz 


Services 


SDN 


Ezra Holdings 


TSC 


H80 


Hongwei Technologies 


Manufacturing 


OlO 


Orchard Parade Holdings 


Property 


Pll 


Pacific Andes (Holdings) 


Manufacturing 


S63 


Singapore Technologies 


Manufacturing 




Engineering 




Y34 


Sunshine Holdings 


Property 


5FG 


Swissco International 


TSC 



FIG. 6: The 10-atom SGX financial molecule, within which the six- 
atom financial molecule shown in Figure [5] is nested, deduced from 
the second natural boundaries in the partial hierarchical clustering 
histories of the strong financial atoms 3, 4, and 5. Compositions of 
the three additional participating weak financial atoms are shown in 
the table above, as are additional non-atomic stocks. The bonds are 
drawn with ci = 280 and C2 = 262. 

In Figure |5] we see that this six-atom financial molecule 
consists of two three-atom clusters, (3, 4, 7) and (5, 6, 8), con- 
nected by a single bond between atoms 3 and 8. Inspection of 



atomic compositions tells us that the property atom 5, banking 
atom 6, and shipping atom 8 consist mostly of local compa- 
nies, whereas the manufacturing atoms 3, 4 and 7 consist only 
of Chinese companies listed on the SGX or China-related lo- 
cal companies. Most of the non-atomic stocks are also stocks 
of Chinese or China-related companies. The larger 10-atom 
financial molecule shown in Figure |6l suggested by the sta- 
tistically more significant second boundaries in the partial hi- 
erarchical clustering histories of financial atoms 3, 4, and 5, 
tells an even more intricate story. Apart from the nested six- 
atom molecular core shown in Figure |5j we find also the par- 
ticipation of financial atoms 1, 9, 10, and 11. In this larger 
financial molecule, we find the same basic topology: a clus- 
ter of China related atoms {3,4,7,9}, and a cluster of local 
atoms {1,5,6,8,11). Apart from the direct bonding between 
financial atoms 3 and 8, the two clusters are also bonded in- 
directly through the weak bonds between atoms 3 and 8 with 
the TSC atom 10. We believe it is likely that in 2005 or 2006, 
the two clusters might actually represent two distinct financial 
molecules, which became increasingly correlated with each 
other in the period leading up to, and beyond, the end-Feb 
2007 market crash known as the Chinese Correction. 
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FIG. 7: The 13-atom molecule in the HKSE, drawn with C[ = 340 
and C2 = 306. This molecule consists of two clusters of atoms, 
{1,8, 10, 14, (6)1, and {2, 7, 9, (5), (11)), connected by direct bonding, 
as well as indirect bonding through atoms 3, 15 and 16. The first 
cluster consists almost exclusively of local atoms, while the second 
cluster are exclusively China atoms. Atoms 3 and 16 are both lo- 
cal atoms, but their constituent companies have very heavy property 
investments in China, while atom 15 is partially-local and partially 
Chinese. 

In the HKSE, we find also a single 13-atom financial 
molecule shown in Figure |7] Its molecular structure is con- 
siderably more complex than the SGX financial molecule, but 
we can still make out two molecular cores, {1, 6, 8, 10, 14) and 
{2, 5, 7, 9), as well as a group {3,11, 15, 16) of bridging atoms. 
Inspection of the atomic compositions within the first molec- 
ular core, we realized that {1,6, 8, 10, 14) are all local atoms, 
whose constituent stocks are issued by companies based in 
Hong Kong. Apart from financial atom 14, which is a bank- 
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ing and finance atom, the rest are all property atoms. The 
second molecular core {2, 5, 7, 9), on the other hand, contains 
only Chinese atoms, whose constituent stocks are issued by 
companies based in China. Unlike the local molecular core, 
atoms from the Chinese molecular core are from a variety of 
industries, ranging from banking and finance (2, 9), to oil and 
energy (7), to mining and metals (5). In the bridging group 
of financial atoms, we find a mix between local and Chinese 
atoms, primarily from the property market (3, 15, 16) and 
mining industry (11). In addition to indirect bonding of the 
two molecular cores through the bridging group of atoms, we 
also find strong direct bonds between the local atom 1 and 
Chinese atoms 2 and 9. The non-atomic stocks are also of 
mixed local and Chinese origins, representing a mixture of in- 
dustries. These are strongly correlated with nearly every con- 
stituent atom, and can most appropriately be interpreted as a 
'valence cloud' of the financial molecule. Unlike the situation 
in the SGX, many Hong Kong companies have direct business 
involvements in China, so it is perhaps not surprising to find 
strong correlations between the local and Chinese molecular 
cores. However, we believe that prior to 2006, atoms within 
the two molecular cores probably constitute several distinct 
financial molecules, whose movements become increasingly 
intertwined over 2006, culminating in the Chinese Correction 
of Feb 2007. We also believe that the industry cross section 
reflected in the HKSE financial molecule will provide impor- 
tant clues to understanding the Chinese Correction. 



2. Financial Molecules in the NYSE, LSE and TSE 

For the larger markets, we find more financial molecules, 
with a variety of sizes and compositions. One of the smallest 
financial molecule we find in the NYSE consists of the atoms 
{2, 17, 64) (shown in Figure[8ll. This financial molecule is in- 
teresting because the strong financial atoms 2 and 17 are both 
in the homebuilding market sector, whereas component stocks 
in the weak financial atom 64 are in the Investment Services 
and Investment Trusts industries. Seeing that most compo- 
nent stocks in atom 64 got into trouble in 2008, this corre- 
lational structure is perhaps a tell-tale sign of the Subprime 
Crisis. Looking for further signatures of the Subprime Crisis, 
we found the large 1 1 -atom financial molecule shown in Fig- 
ure|9l This financial molecule contains almost exclusively real 
estate and property related atoms, except for atom 36, which 
is a mutual funds atom consisting mostly of Cohen & Steers 
funds. Based on the inter-atomic correlations, as well as cor- 
relations between the constituent atoms and constituent non- 
atomic stocks, we find in this financial molecule a molecular 
core consisting of the office, retail, residential and industrial 
real estate investment trust (REIT) atoms 16, 34, 42, 48, 55, 
and 62. This strongly-correlated molecular core is surrounded 
by a inner shell of four atoms (36, 60, 76, 90), bonded directly 
to atom 16, and an outer shell consisting of atom 74, which is 
weakly bonded to every atom in the financial molecule. We 
see here that the mutual funds atom 36 is strongly correlated 
with the molecular core through atom 16, but not with the re- 
maining four shell REIT atoms. Inspecting their atomic com- 
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Code Name 


Market Sector 


Atom 64 


GS Goldman Sachs Gp Inc 


Financial 


LBI Lehman Brothers Inc 


Financial 


MER Merrill Lynch & Co Inc 


Financial 


RJF Raymond James Financial Inc Financial 


Non-Atomic Stocks 


MHO M/I Homes Inc 


Homebuilding 


NVR NVRInc 


Homebuilding 


MHK Mohawk Industries Inc 


Home Furnishing 


DY Dycom Industries Inc 


Heavy Construction 



FIG. 8: The 3-atom molecule in the NYSE, which consists of very 
strongly-bonded homebuilding atoms 2 and 17, both of which are 
weakly bonded to the investment atom 64. Four non-atomic stocks 
are correlated with atoms 2 and 17 strongly above market level corre- 
lations. Two of these, M/I Homes Inc (MHO) and NVR Inc (NVR), 
are homebuilders, while Mohawk Industries Inc (MHK) is in the 
home furnishing business. The last non-atomic stock is Dycom In- 
dustries Inc (DY), who is a provider of specialty engineering and 
construction contracting services. Because of its unique correlational 
signatures, the bonds in this molecule are not drawn using the rules 
described above, but through visual inspection of the actual correla- 
tion submatrix elements. 



positions, we also see that the four shell REIT atoms are all 
rather specialized, either in the realty needs of the healthcare, 
hospitality or outlet mall sectors. 

Another large financial molecule we found in the NYSE 
is the 14-atom oil and gas molecule shown in Figure [TO] All 
atoms in this molecule contain local stocks, and all except one 
are in the closely related oil and gas (21, 26, 32, 81), drilling 
and exploration (22, 31, 46, 73), equipment and services (24, 
44, 52, 56, 73), refining and marketing (35, 81) industries. 
The exception is atom 61, which is a coal atom. The topology 
of this molecule suggests a drilling & exploration and equip- 
ment & services molecular core, with very strong internal cor- 
relations, coupled strongly to a oil & gas molecular core, with 
strong internal correlations. Carefully analyzing the correla- 
tions this local oil & gas molecule have with other financial 
atoms in the NYSE, we find that it is strongly correlated with 
the 17-atom ADR molecule shown in Figure [TT] This ADR 
molecule consists of three distinct molecular cores. The first 
is a foreign oil & gas molecular core comprising the ADR 
atoms 3, 4, and 18. Of these three, the European oil & gas 
atoms 4 and 18 are the most strongly coiTelated with each 
other The second is a mining molecular core, which consists 
of the foreign mining atoms 6 and 10, and the US mining atom 
40. The third molecular core consists of atoms associated with 
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FIG. 9: The 11 -atom real estate financial molecule in the NYSE, 
drawn with ci = 403 and cz = 377. This financial molecule consists 
of a core of six financial atoms (6, 34, 42, 48, 55, and 62, shaded 
gray), surrounded by a inner shell of four financial atoms (36, 60, 
76, 90) bonded to financial atom 16, and an outer shell consisting of 
financial atom 74. The six core real estate investment trust (REIT) 
atoms are in the more generic office, retail, residential and industrial 
realty businesses, whereas the four shell REIT atoms are all rather 
specialized, either in the realty needs of the healthcare, hospital- 
ity or retail mall sectors. Atom 36, which is bonded only to REIT 
atom 16, is a mutual funds atom consisting largely of the Cohen & 
Steers group of realty related funds. In the above diagram, the dashed 
bond to the dashed envelope is a shorthand indicating that atom 74 is 
weakly bonded to every atom within the envelope. 




FIG. 10: The 14-atom US oil & gas financial molecule in the NYSE, 
drawn with ci = 409 and C2 = 383. This financial molecule appears 
to consist of two cores: a very tight drilling & exploration/equipment 
& services core consisting of atoms 22, 24, 31, 44, 46, 56, and a 
looser oil & gas core consisting of atoms 21, 26, 32, and 81. As 
shown using the shorthand introduced in Figure |9l the equipment 
& services atom 73 is weakly bonded to every atom in the dashed 
envelope. Surprisingly, we find the participation of the coal atom 61 
in this oil & gas molecule. 



funds for and banks in emerging markets. In particular, the 
linear subcore (49, 71, 77) are all fund atoms, while the cyclic 
subcore consists of Indian (37) and Korean (70) bank atoms. 
It is through the foreign oil & gas molecular core, and also 
partially through the mining molecular core, that we find the 
strongest coiTelations between the ADR molecule and the US 
oil & gas molecule. The remainder of the ADR molecule has 
a rather complex composition, with 15 and 51 being Euro- 
pean bank atoms, and 20 (beer), 23 (telecommunications), 58 
(steel), 91 (energy) being Brazilian atoms. It is plausible that 
in the past, these five subclusters might represent five distinct 
financial molecules that somehow became increasingly coiTe- 
lated in the period leading up to the global financial crisis. 




FIG. 11: The 17-atom ADR financial molecule in the NYSE, drawn 
with ci = 380 and C2 = 370. The dashed-dotted lines demarcate the 
distinct molecular cores: foreign oil (3, 4, 18), mining (6, 10, 40), and 
emerging market funds and banks (37, 49, 70, 71, 77). The remainder 
of the molecule comes from two groups of atoms: European banks 
(15, 51), and an assortment of Brazilian companies (20, 23, 58, 91). 
This multi-sector molecule is exceptional in the NYSE, where we 
find financial molecules coming from one, or a few closely related 
market sectors. 

In the LSE and TSE, the financial molecules found are gen- 
erally smaller than those found in the NYSE, and closer in size 
to the financial molecules found in the HKSE and SGX. For 
example, the LSE investment trusts financial molecule shown 
in Figure [T2l comprises nine financial atoms, while the TSE 
heavy industry molecule shown in Figure [T3] comprises 12 
financial atoms. The structure of the LSE investment trusts 
molecule is relatively simple, with an inner core consisting of 
the investment trusts (1, 4, 6, 7) and insurance (12) atoms, and 
an outer shell consisting of the investment trusts atoms 9, 17, 
and 19. All non-atomic stocks are from the investment trusts 
and insurance industries, and the only surprising member of 
this molecule is the aerospace and defense atom 22. The TSE 
heavy industry molecule, on the other hand, has a significantly 
more complex structure, consisting apparently of two molec- 
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ular cores, (6, 25, 33), and (1, 10, 13, 27, 30, 41, 48) as well as 
a group of bridging atoms (42, 45). Closer inspection, how- 
ever, reveal a mismatch between the bonding structure and the 
industry distribution. In this TSE financial molecule, we find 
three major market sectors: steel, iron and metal mining (6, 
10, 13, 25, 33, 41, 48), wholesale trade and distributing (1, 
30), and heavy machineries (41, 42, 45). It is interesting that 
the (10, 13, 41, 48) group of metal mining atoms are more 
strongly correlated with the trading and distributing atom 30, 
than they are with the main steel and iron group of atoms (6, 
25, 33). This is inspite of the fact that atom 48 consists of 
mostly steel production stocks. It is also surprising that the 
heavy machineries atom 27 is more strongly correlated with 
the wholesale trade atom 1 than it is with the other heavy ma- 
chineries atoms, 41, 42, and 45. Based on the industry makeup 
of this financial molecule, it is not surprising to find that its 
non-atomic stocks are mostly from the steel and heavy ma- 
chineries market sectors. However, we find it surprising that 
the non-atomic steel stocks are not strongly correlated with 
the steel and iron atoms 6, 25, and 33, but with the wholesale 
trade atom 1, the non-ferrous metal atom 13, and the heavy 
machineries atoms 27, 42, and 45. 




FIG. 12: The 9-atom investment trusts financial molecule in the LSE, 
drawn with ci = 376 and C2 = 348. This molecule consists of an 
inner core of investment trusts (1, 4, 6, 7) and insurance (12) atoms, 
and an outer shell of investment trust atoms (9, 17, 19). Surprisingly, 
the aerospace & defense atom 22 is included as a member of this 
molecule. 

While the molecular structures in the LSE and TSE hold 
some surprises, we find none of the extraordinary correla- 
tional structures seen in the NYSE, HKSE, and SGX, which 
we believe offer insights into such large-scale market events as 
the Chinese CoiTection and the Subprime Crisis. We believe 
that as a general rule, statistical signatures preceding a market 
crash can only be seen in the market(s) of origin. There should 
be little, or no symptoms at all, of such market crashes in the 
other markets that follow the market of origin into decline. 




FIG. 13: The 12-atom multi-sector financial molecule in the TSE, 
drawn with ci = 370 and C2 = 350. This molecule consists of two 
molecular cores, (6, 25, 33), and (1, 10, 13, 27, 30, 41, 48), as well 
as a group of bridging atoms (42, 45). The industries represented in 
this molecule are: steel, iron and metal mining (6, 10, 13, 25, 33, 41, 
48), wholesale trade and distributing (1, 30), and heavy machineries 
(41, 42, 45). Surprisingly, we find that: (i) the (10, 13, 41, 48) of 
metal mining atoms are more strongly correlated with the trading 
and distributing atom 30, than they are with the main steel and iron 
group of atoms (6, 25, 33), inspite of the fact that atom 48 consists 
mainly of steel production stocks; (ii) the heavy machineries atom 
27 is more strongly correlated with the wholesale trade atom 1 than 
it is with the other heavy machineries atoms 41, 42, and 45; (iii) the 
non-atomic stocks are mostly from the steel and heavy machineries 
sectors, but have stronger correlations with the wholesale trade atom 
1, the non-ferrous metal atom 33, and the heavy machineries atom 
27, 42, and 45, and not with the steel and iron atoms 6, 25, and 33. 



However, because of strong coupling between global finan- 
cial markets, we believe it might be possible to see statistical 
signatures of 'aftershocks' that play out simultaneously in all 
markets after the initial market crash. 



C. Financial Supermolecules 

In all markets, we find further self-organization at the level 
of hundreds of stocks. In markets where we find more than 
one financial molecule (NYSE, LSE, and TSE), these can be 
interpreted as bound states of financial molecules, and thus 
we call them financial supermolecules. From a finance point 
of view, the different market sectors in a developed economy 
are subjected to differing degrees of regulation, and as such, 
it is perhaps not surprising to find all stocks within the same 
market sector to be more coiTelated with each other, than with 
stocks from another market sector. This naturally suggests 
that the dynamics of an entire market sector self-organizes 
into a financial supermolecule [isl [T^ [T3l. From the top 
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panel in Figure [141 we find three natural boundaries for a fi- 
nancial supermolecule in the NYSE, with the second bound- 
ary at cluster size of 176 stocks being the most significant 
statistically. If we analyze the composition of this financial 
supermolecule, starting from any of the natural boundaries, 
we find that this supermolecule is primarily the bound state 
between the US oil & gas molecule (shown in Figure [TOli and 
the ADR molecule (shown in Figure fTTTi. If we use the third 
boundary as our definition of the supermolecule, then we find 
it including other US atoms from the oil & gas (67, 105), steel 
(50, 95), emerging market funds (41, 45), industrial machin- 
ery (102), aerospace and defense (89), and construction mate- 
rials (106) sectors. Most of these peripheral atoms are closely 
related to the US oil & gas molecule, or to the ADR molecule. 
From the middle panel in Figure [141 we find a single, statis- 
tically significant, natural boundary for a LSE supermolecule 
comprising nearly all the financial atoms in the market. This 
supermolecule is predominantly financials (atoms 1, 3, 4, 6, 
7, 9, 12, 13, 14, 17, 19, many of which are components of the 
LSE investment trusts molecule shown in Figure [T2b in char- 
acter, and the excluded atoms — atom 2 (oil and gas), atom 
11 (foreign consumer electronics), atom 15 (foreign oil and 
gas), and atom 21 (restaurants) — are all from market sectors 
distant from the finance sector. 

Of course, in the TSE, the financial molecules themselves 
do not respect sector boundaries, probably because of exten- 
sive cross-ownership practiced in this market |48]. There- 
fore, the financial supermolecules do not represent any par- 
ticular market sector, but represent instead some kind of 
'eigen' -trader, which is a collection of institutional and re- 
tail traders whose trading strategies are largely mutually rein- 
forcing within the two-year time frame investigated. Different 
'eigen' -traders probably have different concentrations in their 
portfolio of stocks, and are therefore approximately 'orthogo- 
nal' in their impacts on the market. We suspect this 'orthog- 
onality' of trading strategies lead to financial supermolecules 
being only weakly correlated, and thus an emergent super- 
molecular correlation level that is significantly lower than the 
molecular correlation levels. From the bottom panel in Fig- 
ure[T4l we find a natural boundary for a large financial super- 
molecule in the TSE. This supermolecule incorporates a large 
fraction of the 52 financial atoms found in the TSE, and the 1 1 
financial atoms that are not incorporated are from the banks 
(5, 15, 34, 39, 47), insurance (2), other financing business 
(7), marine transportation (11, 22), utilities (24), and atom 
20, which comprises the stocks issued by Toyota and Honda, 
Japan's largest automobile makers. The exclusion of banks 
from this TSE supermolecule, however, is limited to regional 
banks, as atom 8, which comprises large diversified banks, is 
included in the supermolecule. 

In small and less stringently regulated markets like the 
HKSE and SGX, where we find only a single financial 
molecule, market sector boundaries are again not statistically 
meaningful. For these small markets, we again find financial 
supermolecules consisting of over 100 stocks. These com- 
prises the sole financial molecule, other financial atoms, and 
various more actively traded non-atomic stocks. The super- 
molecular correlation level is lower than for the large markets. 
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FIG. 14: Complete-link partial hierarchical clustering histories 
showing statistical signatures of self-organization at the financial su- 
permolecule level in large mature markets, starting from (top to bot- 
tom): atom 22 of the NYSE, atom 7 of the LSE, and atom 13 of 
the TSE. The natural boundaries of the financial supermolecules are 
indicated by the black triangles. 



and the separation of correlation levels is between actively- 
traded stocks and illiquid stocks. Again, this is the statistical 
signature left by the 'eigen' -trader, which is the effectively 
the whole market in these small markets. From Figure [151 we 
find one natural boundary for the sole HKSE financial super- 
molecule, and two natural boundaries for the SGX financial 
supermolecule. All financial atoms in the HKSE and SGX are 
found to participate in the respective supermolecules. In fact, 
all financial atoms in the SGX are already found to included 
by the first natural boundary, and so the second natural bound- 
ary might indicate a further separation of correlation levels 
caused by different Uquidity levels in the market. 



III. SUMMARY AND DISCUSSIONS 

In conclusion, we considered the phenomenon of separation 
of dynamical time scales, which arises generically as a result 
of self-organization in a complex system with a large number 
of interacting microscopic variables. We described how these 
separated time scales, and their associated dynamical struc- 
tures, can be learned statistically, by examining short-time 
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FIG. 15: Statistical signatures for self-organization at the financial 
supermolecule level, within the complete-link partial hierarchical 
clustering histories of small markets: (top) starting from atom 7 of 
the HKSE, and (bottom) starting from atom 1 of the SGX. The nat- 
ural boundaries of the financial supermolecules are indicated by the 
black triangles. 



correlational statistics between the microscopic variables over 
a longer time, and identifying separated correlation levels at 
the long time scale. Arguing that only robust dynamical struc- 
tures are physically meaningful, we explained why we need 
only look out for separated long-time correlation levels that 
are insensitive to details of the short-time and long-time statis- 
tics examined. After recasting the problem of finding a small 
number of elfective variables (the self-organized dynamical 
structures) from a large number of interacting microscopic 
variables as a statistical learning problem, we moved on to de- 
scribe our two-time-scale statistical clustering method. In this 
method, we would first cluster the time series of the micro- 
scopic variables on a short time scale. Thereafter, we would 
construct a structurally-simple long-time correlation matrix 
by counting the number of times pairs of microscopic vari- 
ables are assigned to the same short-time clusters over the long 
time scale. We then developed the method of partial hierarchi- 
cal clustering to automatically and systematically extract the 
separated correlation levels, and their associated collections of 
microscopic variables, from the long-time correlation matrix. 

Using the two-time-scale statistical clustering method to 
analyze the five equity markets shown in Table J] over the 
two-year period from 2006 to 2007, we found in all markets 
separated correlation levels that point to self-organization at 
several hierarchical levels. We call the separated correlation 
levels atomic, molecular, and supermolecular correlation lev- 
els, and the effective dynamical variables they are associated 
with financial atoms, financial molecules, and financial su- 
permolecules. In all markets, financial atoms are found to 
consist of around 10 stocks, having internal correlations far 
above the background correlation level caused by market-level 



drifts. Some financial atoms are trivial, consisting of two or 
more stocks issued by the same company, while others have 
compositions expected of companies competing directly with 
each other within the same industry. Quite a number of fi- 
nancial atoms, however, consist surprisingly of stocks from 
very different industries. We find that these surprising atomic 
compositions can only be explained in terms of the ownership 
profiles and broader business conflicts and mutual interests of 
the constituent stocks. In the more globalized NYSE and LSE, 
we also find foreign atoms, which are part of financial atoms 
in their home markets. 

In the smaller markets (HKSE and SGX), we find a single 
financial molecule made up of around 10 financial atoms. In 
the larger markets (NYSE, LSE, and TSE), we find many fi- 
nancial molecules, with sizes ranging from several to around 
20 atoms. In the NYSE and LSE, financial molecules com- 
prise atoms from a small number of closely related mar- 
ket sectors, whereas in the TSE, HKSE, and SGX, financial 
molecules do not respect market sector boundaries, sporting 
constituent atoms from distant sectors. We suspect that the 
former reflects the stringent market regulations in place in the 
NYSE and LSE, while the latter is a consequence of exten- 
sive cross-ownership in the TSE, and dominance by a small 
number of multi-industry holding companies in the HKSE 
and SGX. In the NYSE and LSE, market sector boundaries 
continue to be more or less respected by their financial su- 
permolecules, and as such, market sectors can serve as high- 
level effective variables for understanding the dynamics in 
these markets. In contrast, the high-level effective variables 
in the TSE, HKSE, and SGX are multi-sector financial su- 
permolecules associated with 'eigen' collections of institu- 
tional and retail traders with roughly 'orthogonal' market im- 
pacts. Close examinations of the correlational structures of 
the NYSE, HKSE, and SGX financial molecules reveal very 
interesting signatures of such large market events as the Sub- 
prime Crisis and the Chinese Correction. No extraordinary 
structures were seen in the LSE and TSE financial molecules. 
This led us to conclude that precursor signatures of large mar- 
ket events can only be detected in their markets of origin. 

It is important to point out here that financial atoms and 
molecules are not merely curiosity items. The robust identi- 
fication of financial atoms and molecules forms the first step 
in our program to understand real financial markets. Once the 
financial atoms and molecules are found, we will use them as 
effective variables to build effective dynamical models. Un- 
der normal market conditions, we will then apply understand- 
ing derived from this effective modeling to risk analysis and 
portfolio management. Portfolio optimization schemes based 
on the minimal spanning tree analysis of market correlations 
have been developed IUtI l49ll . but we believe it is possible 
to bring the state of the art to a whole new level. For exam- 
ple, we believe current hedging strategies used by large fund 
managers must be revised, in line with the structures of finan- 
cial atoms and molecules found in the respective markets. In 
particular, when designing a portfolio, it is important to ob- 
serve that fluctuations in the price-weighted mean (the center 
of mass) of large financial molecules are suppressed relative to 
market-level fluctuations, because of the combined inertia of 



all their strongly-correlated constituent stocks. On the other 
hand, we need to be more careful investing in small finan- 
cial molecules, because they are 'lighter', and are thus more 
susceptible to driving by market 'forces'. More importantly, 
by making trading decisions at the financial molecule level, 
it is possible to unambiguously decompose the risk any given 
portfolio is exposed to, into a systematic risk component as- 
sociated with the effectively deterministic dynamics of the fi- 
nancial molecules, and a stochastic risk component associated 
with random, unpredictable shocks experienced by the market 
on a whole. Our findings suggest that the systematic risk can 
be further reduced by taking into account the effective dynam- 
ics of the financial supermolecules. 

Our analyses and findings have even more serious implica- 
tions to understanding and coping with financial crises. As 
suggested by previous works L5Clil5lll52ll53il . and also judg- 
ing by the intriguing results seen in the NYSE, HKSE, and 
SGX, we believe a market becomes extremely susceptible to 
crashes when a significant fraction of stocks gets lock into a 
single financial supermolecule. If we can determine the crit- 
ical size of such a market-spanning supermolecule, and also 
continuously track the sizes of growing supermolecules, we 
might be able to either forecast the onset of a financial crisis 
one or two months in advance, or perhaps even avert the crisis 
entirely. To this end, we are currently looking at the Chinese 
Correction on a shorter time scale. By tracking the dynam- 
ics of financial molecules in the SGX, and the evolution of 
their molecular structures from month to month over 2006 and 
2007, we aim to develop a dynamical picture of bond forma- 
tion prior to, and bond breaking after the market crash. We 
believe such a 'chemical' picture will offer important clues 
to understanding the end-February 2007 Chinese Correction, 
which probably catalyzed the Subprime Crisis that surfaced a 
few months later 
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APPENDIX A: LISTS OF STRONG FINANCIAL ATOMS 



1. New York Stock Exchange 



Atom 


Code 


Name 


Market Sector 


Remarks 


1 


VIA 
VIA.B 


Viacom A 
Viacom B 


Services 
Services 




2 


LEN 
LEN.B 


Lennar A 
Lennar B 


Industrial Goods 
Industrial Goods 


Residential Con- 
struction 

Residential Con- 
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Atom 


Code 


Name 


Market Sector 


Remarks 


3 


PBR 


Petroleo 
Brasileiro SA 


Basic Materials 


ADR 




PBR.A 


Petroleo 

r>i <jM it:ii u 


Basic Materials 


ADR 


4 


RDS.A 


Royal Dutch 
Shell A 


Basic Materials 


ADR 




RDS.B 


Royal Dutch 
Shell B 


Basic Materials 


ADR 


5 


CBS 


CBS 








CBS.A 


CBS 


Services 




6 


BHP 


BHP Billiton 


Basic Materials 


ADR 




BBL 


BHP Billiton 


Basic Materials 


ADR 




RTF 


Rio Tinto pic 


Basic Materials 


ADR 


7 


LAZ 


Lazard 


Financial 


Asset Manage- 
ment 




LDZ 


Lazard 6.625% 


Financial 


Equity Security 
Units 


8 


UN 


Unilever NV 


Consumer Goods 


ADR 




UL 


Unilever pic 


Consumer Goods 


ADR 


9 


CCL 


Carnival Corp 


Services 


General Enter- 
tainment 




CUK 


Carnival pic 


Services 


ADR 




RCL 


Royal Canibean 
Cruises 


Services 


General 
Entertainment 


10 


RIO 


Vale 


Basic Materials 


ADR 




RIO- 


Vale 


Basic Materials 


ADR 


11 


FCE.A 


Forest City Enter- 
prises A 


Financial 


Property Man- 
agement 




FCE.B 


Forest City Enter- 
prises B 


Financial 


Property Man- 
agement 


12 


NWS 


News Corp 


Services 






NWS. A 


News Corp A 


Services 




13 


ETR 


Entergy Corp 


Utilities 


Electric Utilities 




ETR.PRA 


Entergy Corp 
7.625% 


Utilities 


Equity Security 
Units 


14 


ENL 


Reed Elsevier 

i> V 


Services 


ADR 




RUK 


Reed Elsevier pic 


Services 


ADR 


15 


BBD 


Banco Bradesco 
SA 


Financial 


ADR 




ITU 


Banco Itau Hldg 
Financeira SA 


Financial 


ADR 


16 


DDR 


Developers Di- 
versified Realty 
Corp 


Financial 


Retail REIT 




REG 


Regency Centers 
Corp 


Financial 


Retail REIT 




MAC 


The Macerich Co 


Financial 


Retail REIT 




SPG 


Simon Property 
Gp Inc 


Financial 


Retail REIT 




EQR 


Equity Residen- 
tial 


Financial 


Residential REIT 




GGP 


General Growth 
Properties Inc 


Financial 


Retail REIT 
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Atom 


Code 


Name 


Market Sector 


Remarks 


17 


KBH 


KB Home 


Industrial Goods 


Residential Con- 
struction 




PHM 


Pulte Homes Inc 


Industrial Goods 


Residential Con- 
struction 




CTX 


Centex Corp 


Industrial Goods 


Residential Con- 
struction 




MDC 


MDC Hldg Inc 


Industrial Goods 


Residential Con- 
struction 




TOL 


Toll Brothers Inc 


Industrial Goods 


Residential Con- 
struction 




UriL 


UK riorion inc 


Industrial Goods 


Residential Con- 
struction 




RYL 


The Ryland Gp 
Inc 


Industrial Goods 


Residential Con- 
struction 




HOV 


Hovnanian Enter- 
prises Inc 


Industrial Goods 


Residential Con- 
struction 




SPF 


Standard Pacific 
Corp 


Industrial Goods 


Residential Con- 
struction 




MTH 


Meritage Homes 
Corp 


Industrial Goods 


Residential Con- 
struction 




BHS 


Brookfield 
Homes Corp 


Industrial Goods 


Residential Con- 
struction 


18 




Fni SPA 


jjojMv ivirtit-iitua 


ADR 




TOT 


Total SA 


Basic Materials 


ADR 




BP 


BP pic 


Basic Materials 


ADR 




REP 


Repsol YPF SA 


Basic Materials 


ADR 


19 


MET 


MetLife Inc 


Financial 


Life Assurance 






MetLife 6.375% 


Financial 


Equity Security 
Units 


20 


ABV 


Companhia de 
Bebidas das 
Americas 


Consumer Goods 


ADR 




ABV.C Companhia de Consumer Goods 


ADR 






Bebidas das 










Americas 






21 


CVX 


Chevron Corp 


Basic Materials 


Integrated Oil & 
Gas 




XOM 


Exxon Mobile 
Corp 


Basic Materials 


Integrated Oil & 
Gas 




PEO 


Petroleum & Re- 
sources Corp 


Financial 


Equity Closed- 
End Fund 




COP 


ConocoPhilUps 


Basic Materials 


Gas 




OXY 


Occidental 
Petroleum Corp 


Basic Materials 


Independent Oil 
&Gas 




MRO 


Marathon Oil 
Corp 


Basic Materials 


Oil & Gas Refin- 
ing & Marketing 


22 


ESV 


Ensco Intl Inc 


Basic Materials 


Oil & Gas 
Drilling & Ex- 
ploration 




NE 


Noble Corp 


Basic Materials 


Oil & Gas 
Drilling & Ex- 
ploration 




PDE 


Pride Intl Inc 


Basic Materials 


Oil & Gas 
Drilling & Ex- 
ploration 


23 


BRP 


Brasil Telecom 
Participacoes SA 


Technology 


ADR 




BTM 


Brasil Telecom 
SA 


Technology 


ADR 




TBH 


Brazilian Tele- 
com HOLDRS 


Technology 


ADR 


24 


SLB 


Schlumberger 


Basic Materials 


Oil & Gas Equip- 
ment & Services 




sn 


Smith Intl Inc 


Basic Materials 


Oil & Gas Equip- 
ment & Services 



Atom 


Code 


Name 


Market Sector 


Remarks 


25 


KV.A 


KV Pharma A 


Healthcare 


Drug DeUvery 




KV.B 


KV Pharma B 


Healthcare 


Drug Delivery 



26 



CNQ 
SU 
ECA 
TLM 
SFY 
CHK 
NXY 
EOG 
PCZ 



Canadian Natural 
Resources 
Suncor Energy 
Inc 

EnCana Corp 

Talisman Energy 
Inc 

Swift Energy Co 

Chesapeake 
Energy Corp 
Nexen Inc 

EOG Resources 
Inc 

Petro-Canada 



Basic Materials 
Basic Materials 
Basic Materials 
Basic Materials 
Basic Materials 
Basic Materials 
Basic Materials 
Basic Materials 
Basic Materials 



Independent Oil 
&Gas 

Independent Oil 
&Gas 

Independent Oil 
&Gas 

Independent Oil 
& Gas 

Independent Oil 
&Gas 

Independent Oil 
& Gas 

Independent Oil 
&Gas 

Independent Oil 

& Gas 

Oil & Gas Refin- 



27 


MIR 

MIR.WSA 
MIR.WSB 


Mirant Corp 

Mirant Corp 
Mirant Corp 


Utilities 
Utilities 
Utilities 






28 


HEI 
HEI.A 


HEICO Corp 
HEICO Corp 


Industrial Goods 
Industrial Goods 


Aerospace i 
fense 

Aerospace i 
fense 


iDe- 
iDe- 


29 


BBI 
BBI.B 


Blockbuster Inc 
A 

Blockbuster Inc 
B 


Services 
Services 







30 


BNI 


Burlington 


Services 


Raih-oads 






Northern Santa 










Fe Corp 








UNP 


Union Pacific 


Services 


Raihoads 






Corp 








CSX 


CSX Corp 


Services 


Railroads 




NSC 


Norfolk Southern 


Services 


Railroads 






Corp 






2. London Stock Exchange 


Atom 


Code 


Name 


Market Sector 


Remarks 


1 


SDR 


Schroders 


Financials 


Investment 










Services 




SDRt 


Schroders 


Financials 


Investment 










Services 


2 


RDSa 


Royal Dutch 


Energy 


Integrated Oil & 






Shell 




Gas 




RDSb 


Royal Dutch 


Energy 


Integrated Oil & 






Shell 




Gas 


3 


HMSO 


Hammerson 


Financials 


REIT 




m 


Liberty Intl 


Financials 


REIT 




LAND 


Land Securities 


Financials 


REIT 






Gp 








BXTN 


Brixton 


Financials 


Real Estate Ops 


4 


SCIN 


Scottish Invest- 


Financials 


Investment Trusts 






ment Trust 








WTAN Witan Investment 


Financials 


Investment Trusts 






Tnist 






5 


AAL 


Anglo American 


Basic Materials 


Mining & Metals 




BLT 


BHP Billiton 


Basic Materials 


Mining & Metals 




RIO 


Rio Tinto 


Basic Materials Mining & Metals 
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Code 




iVltlllvCl OCULtJl 




6 


MNKS 


Monks Invest- 
ment trust 


Financials 


Investment Trusts 




SMT 


Scottish Mort- 
gage Investment 
Trust 


Fmancials 


Investment Trusts 


7 


EDIN 


Edinburgh Invest- 
ment Trust 


Financials 


Investment Trusts 




MRCM Mercantile In- 


Financials 


Investment Trusts 






VC7M 1 1 it:m 1 1 iiM 






8 


ANTO 


Antofagasta 


Basic Materials 


Mining & Metals 




VED 


Vedanta Re- 
sources 


Basic Materials 


Mining & Metals 


9 


ATST 


Alliance Trust 


Financials 


Investment Trusts 




FEV 


Fidelity Euro- 
pean Values 


Financials 


Investment Trusts 




RCP 


RIT Capital Part- 
ners 


Financial 


Investment Trusts 




BDEV 


Barratt Develop- 
ments 


Cyclical Con- 
sumer Goods & 
Services 


Homebuilding 




PSN 


Persimmon 


Cyclical Con- 

sUlllCI vJOUUa Oc 

Services 


Homebuilding 




RDW 


Redrew 


Cyclical Con- 

sUlllCI vJOUUa Oc 

Services 


Homebuilding 


11 


0593q 


Samsung Elec- 


Cyclical Con- 


Consumer Elec- 






tronics 


sumer Goods & 
Services 


tronics 




0593xq Samsung Elec- 


Cyclical Con- 


Consumer Elec- 






tronics 


sumer CjoocIs & 
Services 


tronics 




AV 


Aviva 


Financials 


Insurance - Mul- 
tiline 




PRU 


Prudential 




& Health 




LGEN 


Legal and Gen- 
eral Gp 


Financials 


Insurance - ^4ul- 
tiline 




FP 


Friends Provident 


Financials 


Insurance - Life 
& Health 




STAN 


Standard Char- 
tered 


Financials 


Banks 




FRCL 


Foreign & Colo- 
nial Investment 
Tnist 


Financials 


Investment Trusts 



3. Tokyo Stock Exchange 



Atom Code Name Market Sector Remarks 

1 8001 Itochu Corp Wholesale Trade Section 1 

8002 Marubeni Corp Wholesale Trade Section 1 

8031 Mitsui & Co Wholesale Trade Section 1 

8053 Sumitomo Corp Wholesale Trade Section 1 

8058 Mitsubishi Corp Wholesale Trade Section 1 



Atom Code 


Name 


Market Sector 


Remarks 


2 8725 


Mitsui Sumitomo 

Tn CI iTiiTir*f* 1 tI^ 

Hldg 


Insurance 


Section 1 


8754 


NIPPONKOA In- 
surance Co. 


Insurance 


Section 1 


8755 


Sompo Japan In- 
surance Inc. 


Insurance 


Section 1 


8759 


Nissay Dowa 
General Insur- 
ance Co. 


Insurance 


Section 1 


8761 


rtlUl Ilia Ul oLICC 

Co 


lIlaUlollL'C 


OCCUUll 1 


8763 


Fuji Fire & Ma- 
rine Insurance Co 


Insurance 


Section 1 


3 8601 


Daiwa Securities 
Gp Inc 


Securities 


Section 1 


8604 


Nomura Hldg Inc 


Securities 


Section 1 


ODUO 


Shinko Securities 
Co 


Securities 


Section 1 


8607 


Seciuities Co 


OCCUllLlCo 


OCL'LlUll 1 


8609 


wJ4-aa«ll OCCUll 

ties Gp Inc 


occuiiiica 


OCCllUll 1 


8613 


Marusan Securi- 
ties Co 


Securities 


Section 1 


8614 


Toyo Securities 
Co 


Securities 


Section 1 


8616 


Tokai Tokyo Se- 
curities Co 


Securities 


Section 1 


ouzz 


Mito Securities 
Co 


Securities 


Section 1 


8624 


Tchiyoshi Securi- 
ties Co 


Securities 


Section 1 


4 8801 


Mitsui Fudosan 
Co 


Real Estate 


Section 1 


8802 


Mitsubishi Estate 
Co 


Keai Estate 


oection 1 


8815 TokyuLandCo 


Real Estate 


Section 1 


8830 Sumitomo Realty 


Real Estate 


Section 1 




& Development 








Co 






5 8356 


The Juroku Bank 


Banks 


Section 1 


8360 


The Yamanashi 
Chuo Bank 


Banks 


Section 1 


8362 


The Fukui Bank 


Banks 


Section 1 


8368 


The Hyakugo 
Bank 


Banks 


Section 1 


8381 


The San-in Godo 
Bank 


Banks 


Section 1 


8382 


The Chugoku 
Bank 


Banks 


Section 1 


8386 


The Hyakujushi 
Bank 


Banks 


Section 1 


8388 


The Awa Bank 


Banks 


Section 1 


8390 


The Kagoshima 
Bank 


Banks 


Section 1 


8394 


The Higo Bank 


Banks 


Section 1 


6 5401 


Nippon Steel 
Corp 


Steel & Iron 


Section 1 


5405 


Sumitomo Metal 
Industries 


Steel & Iron 


Section 1 


5406 


Kobe Steel 


Steel & Iron 


Section 1 


5411 


JFE Hldg Inc 


Steel & Iron 


Section 1 
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Atom 


Code 


Name 


Market Sector 


Remarks 


7 


8253 


Credit Saison Co 


Other Financing 
Business 


Section 1 




8515 


Aiful Corp 


Other Financing 
Business 


Section 1 




8564 


Takefuji Corp 


Other Financing 
Business 


Section 1 




8572 


Acorn Co 


Other Financing 
Business 


Section 1 




8574 


Promise Co 


Other Financing 
Business 


Section 1 




8583 


Mitsubishi UFJ Other Financing Section 1 






Financial Gp Inc 


Business 




8 


8308 


Resona Hldg Inc 


Banks 


Section 1 




8309 


Chuo Mitsui 
Trust Hldg Inc 


Banks 


Section 1 




8403 


The Sumitomo 
Trust & Banking 
Co 


Banks 


Section 1 




8404 


Mizuho Trust & 
Banldng Co 


Banks 


Section 1 




8411 


Mizuho Financial 
Gp Inc 


Banks 


Section 1 


9 


4321 


Kenedix Inc 


Services 


Section 1 




8804 Tokyo Tatemono 


Real Estate 


Section 1 






Co 








8868 


Urban 


Real Estate 


Section 1 




8888 


Creed Co 


Real Estate 


Section 1 




8902 


Pacific Hldg Inc 


Real Estate 


Section 1 


10 


5726 


Osaka Titanium 
Technologies Co 


Non-Ferrous 
Metals 


Section 1 




5727 


Toho Titanium Non-Ferrous 


Section 1 






Co 


Metals 




11 


9104 


Mitsui O.S.K. Marine Trans- 


Section 1 






Lines 


portation 






9107 


Kawasaki Kisen Marine Trans- 


Section 1 






Kaisha 


portation 




12 


1801 


Taisei Corp 


Construction 


Section 1 




1802 


Obayashi Corp 


Construction 


Section 1 




1803 


Shimizu Corp 


Construction 


Section 1 




1812 


Kajima Corp 


Construction 


Section 1 


13 


5711 


Mitsubishi Mate- 
rials Corp 


Non-Ferrous 
Metals 


Section 1 




5713 


Sumitomo Metal 
Mining Co 


Non-Ferrous 

Metals 


Section 1 




5771 


Mitsubishi Shind 






14 


4751 


CyberAgent Inc 




Mothers 




4788 Cyber Communi- 




Mothers 






cations Inc 







*This instrument code no longer exist on the TSE, but may be related to 
Urban Life Co, in the real estate sales, brokerage, and leasing business, 
traded on the Osaka Securities Exchange as 8851. 



4. Hong Kong Stock Exchange 



Atom Code 


Name 


Market Sector 


Remarks 


1 
1 


0001 


Cheung Kong 


PrnnprtipQ 








Hldg 








0011 


Hang Seng Bank 


Financf* 






0013 


Hutchison 


Consolidated En- 








Whampoa 


terprises 






0016 


Sun Hung Kai 


Properties 








Properties 








0066 


MTR Corp 


Utilities 






0941 


China Mobile 


Consolidated En- 










terprises 




2 


0762 


China Unicom 


Consolidated En- 








(HK) 


terprises 






2318 


Ping An Insur- 


Finance 








ance (Gp) Co of 










China 








2628 


China Life Insur- 


Finance 








ance Co 






3 


0688 


China Over- 


Properties 








seas Land & 










Investment 








1109 


China Resources 


Properties 








Land 








3383 Agile Properties 


Properties 








Hldg 






4 


2899 Zijin Mining Gp 


Miscellaneous 








Co 








3330 


Lingbao Gold Co 


Miscellaneous 




5 


0323 


Maanshan Iron & 


Industries 








Steel Co 








0347 


Angang Steel Co 


Industries 




6 


0019 


Swire Pacific A 


ConsoUdated En- 










terprises 






0087 


Swire Pacific B 


Consolidated En- 










terprises 




7 


0857 


PetroChina Co 


Industries 


Oil & Natural 










Gas 




0883 


CNOOC 


Industries 


Oil & Natural 










Gas 



18 



5. Singapore Stock Exchange 



Atom Code 


JName 


Market Sector Remarks 


1 


Z74 


Singtel 


TSC 






Singtel 10 


TSC 






Singtel 100 


TSC 


2 


C6L 


Singapore Air- 


TSC 






lines 








Singapore Air- 


TSC 






lines 200 




3 


C56 


Celestial Nutri- 


Manufacturing 






foods 






C86 


China Sun Bio- 


Manufacturing 






Chem Tech Gp 








Co 




4 


C68 


Mrrach Energy 


Services 




W81 


oKy L^nina 


Services 






Pftfnlfiiin Sa'pc 






F33 


Ferrochina 


A4anutacturing 




E90 


China Sky Chem- 


Manufacturing 






ical Fibre Co 




5 


C31 


CapitaLand 


Properties 




C09 


City Development 


Properties 


6 


D05 


DBS Gp Hldg 


Finance 




Ull 


United Overseas 


Finance 






Bank 






039 


Oversea-Chinese 


Finance 






Banking Corp 






W05 


Wing Tai Hldg 


Properties 



[1] All transactions through electronic order books are automati- 
cally logged, and subsequently incorporated into various high- 
frequency financial databases. Some over-the-counter (OTC) 
transactions are also logged, but these do not always make their 
way into the main databases. 

[2] Downloaded using the MATLAB script 
|http : //www.mathworks . com/matl abcentral /fileex changi 
written by Josiah Renfree. 

[3] In principle, we can sacrifice some statistical significance, and 
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